Search CORE

23 research outputs found

Practical algorithms for selection on coarse-grained parallel computers

Author: I. Al-Furiah
S. Aluru
S. Goil
S. Ranka
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Efficient Evaluation of Sparse Data Cubes

Author: J. Gray
S. Chaudhuri
S. Goil
T. Johnson
V. Harinarayan
Y. Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

available at www.springerlink.com ***Note: Figures may be missing from this format of the document Computing data cubes requires the aggregation of measures over arbitrary combinations of dimensions in a data set. Efficient data cube evaluation remains challenging because of the potentially very large sizes of input datasets (e.g., in the data warehousing context), the well-known curse of dimensionality, and the complexity of queries that need to be supported. This paper proposes a new dynamic data structure called SST (Sparse Statistics Trees) and a novel, in-teractive, and fast cube evaluation algorithm called CUPS (Cubing by Pruning SST), which is especially well suitable for computing aggregates in cubes whose data sets are sparse. SST only stores the aggregations of non-empty cube cells instead of the detailed records. Furthermore, it retains in memory the dense cubes (a.k.a. iceberg cubes) whose aggregate values are above a threshold. Sparse cubes are stored on disks. This allows a fast, accurate approximation for queries. If users desire more refined answers, related sparse cubes are aggregated. SST is incrementally maintainable, which makes CUPS suitable for data warehousing and analysis of streaming data. Experiment results demonstrate the excellent performance and good scalability of our approach. Article

CiteSeerX

Crossref

Physiological studies on trematodes ? Fasciola gigantica rate of oxygen consumption

Author: E. Bueding
Grembergen van
H. Laser
M. Lazarus
M. M. Goil
M. M. Goil
M. M. Goil
Madan M. Goil
O. Harnisch
W. P. Rogers
W. S. Hunter
W. S. Hunter
W. W. Umbreit
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1961
Field of study

Crossref

A scalable parallel subspace clustering algorithm for massive data sets

Author: Alok Choudhary
Harsha S Nagesh
Sanjay Goil
Publication venue
Publication date: 01/01/2000
Field of study

Clustering is a data mining problem which finds dense regions in a sparse multi-dimensional data set. The attribute values and ranges of these regions characterize the clusters. Clustering algorithms need to scale with the data base size and also with the large dimensionality of the data set. Further, these algorithms need to explore the embedded clusters in a sub-space of a high dimensional space. However, the time complexity of the algorithm to explore clusters in subspaces is exponential in the dimension-ality of the data and is thus extremely compute intensive. Thus, paral-lelization is the choice for discovering clusters for large data sets. In this paper we present a scalable parallel subspace clustering algorithm which has both data and task parallelism embedded in it. We also formulate the technique of adaptive grids and present a truly un-supervised clustering al-gorithm requiring no user inputs. Our implementation shows near linear speedups with negligible communication overheads. The use of adaptive grids results in two orders of magnitude improvement in the computation time of our serial algorithm over current methods with much better quality of clustering. Performance results on both real and synthetic data sets with very large number of dimensions on a 16 node IBM SP2 demonstrate our algorithm to be a practical and scalable clustering technique. 1

CiteSeerX

Spectrophotometric analysis of haemoglobins of some digenetic trematodes and their hosts

Author: Ather H. Siddiqi
Goil
Prosser
S. Ashfaq Haider
Van Grembergen
Wharton
Publication venue: 'Cambridge University Press (CUP)'
Publication date
Field of study

Crossref

Driving Scientific Applications by Data in Distributed Environments

Author: D. W. Peaceman
M. D. Beynon
Q. Lu
S. Goil
T. Kurc
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

PARALLEL PROCESSING OF OLAP QUERIES USING A CLUSTER OF WORKSTATIONS

Author: Baker M.
Dehne F.
Devlin B.
Geist A.
Goil S.
R. MALL
S. DEHURI
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date
Field of study

Crossref

A Multidimensional OLAP Engine Implementation in Key-Value Database Systems

Author: C Baru
C Baru
J Dean
J Duda
O Romero
RC Taylor
S Goil
S Melnik
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Phosphatase activity in Ganeo tigrinum from Rana tigrina

Author: A.J. Probert
A.J. Probert
D.L.H. Robinson
D.W. Halton
E. Pennoit-De Cooman
E.J. King
G. Gomori
G.K. Reznik
L. Ma
M. Srivastava
M.M. Goil
M.M. Goil
Ramesh Chandra Gupta
S. P. Gupta
W.A. Nizami
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1977
Field of study

Crossref

A Practice of TPC-DS Multidimensional Implementation on NoSQL Database Systems

Author: H. Dutta
H. Wang
J. Dean
L. d’Orazio
L. Wu
O. Romero
R. Moussa
S. Goil
S. Melnik
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref